Translation Acquisition Using Synonym Sets

نویسندگان

  • Daniel Andrade
  • Masaaki Tsuchida
  • Takashi Onishi
  • Kai Ishikawa
چکیده

We propose a new method for translation acquisition which uses a set of synonyms to acquire translations from comparable corpora. The motivation is that, given a certain query term, it is often possible for a user to specify one or more synonyms. Using the resulting set of query terms has the advantage that we can overcome the problem that a single query term’s context vector does not always reliably represent a terms meaning due to the context vector’s sparsity. Our proposed method uses a weighted average of the synonyms’ context vectors, that is derived by inferring the mean vector of the von Mises-Fisher distribution. We evaluate our method, using the synsets from the cross-lingually aligned Japanese and English WordNet. The experiments show that our proposed method significantly improves translation accuracy when compared to a previous method for smoothing context vectors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effect of Teaching Vocabulary through Synonymous, Semantically Unrelated, and Hyponym Sets on EFL Learners’ Retention

Many textbooks include semantically related words and sometimes teachers add synonyms, antonyms, etc. to the words in order to present new vocabulary items without questioning the possible effects. This study sought to investigate the effect of teaching vocabulary through synonym, semantically unrelated, and hyponym sets based on Higa’s (1963) proposed continuum. A total of 120 Iranian intermed...

متن کامل

Lexical Acquisition with WordNet and the Mikrokosmos Ontology

This paper discusses an approach to augmenting a lexicon for knowledge-based machine translation (KBMT) with information derived from WordNet. The Mikrokosmos project at NMSU's Computing Research Laboratory has concentrated on the creation of the Spanish and Japanese lexicons, so the English lexicon is less developed. We investigated using WordNet as a means to automate portions of the English ...

متن کامل

‘Repetition’ in Arabic-English Translation: The case of Adrift on the Nile

Abstract This study investigates ‘repetition’ in the English translation of the Arabic Novel, Adrift on the Nile (1993). It aims to explore the communicative functions of ‘repetition’ and to see if these functions have been maintained or lost in the process of translating the Novel. In addition, it seeks to find the translation strategies used in rendering ‘repetition’. To achieve this aim, a d...

متن کامل

Automatic Acquisition of Paradigmatic Relations Using Iterated Co-occurrences

We introduce the notion of iterated co-occurrences, which can be obtained by performing the calculation of statistically significant cooccurrences not on sentence level, but on co-occurrence sets of previous calculations. The underlying mechanisms are explained in detail and we give reasons, why this iteration results in sets of semantically homogeneous words. These can be used for the automati...

متن کامل

‘Repetition’ in Arabic-English Translation: The case of Adrift on the Nile

Abstract This study investigates ‘repetition’ in the English translation of the Arabic Novel, Adrift on the Nile (1993). It aims to explore the communicative functions of ‘repetition’ and to see if these functions have been maintained or lost in the process of translating the Novel. In addition, it seeks to find the translation strategies used in rendering ‘repetition’. To achieve this aim, a d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013